Value-Based Policy Teaching with Active Indirect Elicitation
نویسندگان
چکیده
Many situations arise in which an interested party’s utility is dependent on the actions of an agent; e.g., a teacher is interested in a student learning effectively and a firm is interested in a consumer’s behavior. We consider an environment in which the interested party can provide incentives to affect the agent’s actions but cannot otherwise enforce actions. In value-based policy teaching, we situate this within the framework of sequential decision tasks modeled by Markov Decision Processes, and seek to associate limited rewards with states that induce the agent to follow a policy that maximizes the total expected value of the interested party. We show value-based policy teaching is NP-hard and provide a mixed integer program formulation. Focusing in particular on environments in which the agent’s reward is unknown to the interested party, we provide a method for active indirect elicitation wherein the agent’s reward function is inferred from observations about its response to incentives. Experimental results suggest that we can generally find the optimal incentive provision in a small number of elicitation rounds.
منابع مشابه
Enabling Environment Design via Active Indirect Elicitation
Many situations arise in which an interested party wishes to affect the decisions of an agent; e.g., a teacher that seeks to promote particular study habits, a Web 2.0 site that seeks to encourage users to contribute content, or an online retailer that seeks to encourage consumers to write reviews. In the problem of environment design, one assumes an interested party who is able to alter limite...
متن کاملDeveloping a Model for the Distance Education System as a Teaching Organization Based on Grounded Theory
Problem and purpose: The purpose of higher education is viewed as a factor in the implementation of economic, social and cultural development programs. The changes, complexities and dynamics that have occurred in the economic, political and social systems in the current era have caused organizations to turn to new methods and solutions for their administration. Therefore, the purpose of this re...
متن کاملDirect Elicitation of Indirect Preferences
This paper re-examines the discrete choice methods and stated preference elicitation procedures that are commonly used in choice-based conjoint analysis. The aim is to clarify their domains of applicability and provide reliable techniques for data collection and analysis. CONTENTS Topic Page Preface 2 1 Choice Behavior and Consumer Welfare 2 1.1 Utility 5 1.2 Choice Probabilities 8 1.3 Consumer...
متن کاملThe impacts of Direct and Indirect Observation on EFL Iranian Teachers' Teaching Quality Improvement
The present paper sought to examine the effectiveness of less experienced teachers' participationin experienced teachers' classes on students' achievements in terms of their proficiency levels inboth Elementary and Pre-intermediate levels. This quasi-experimental design study was conductedin three Language institutes in Tehran, Iran. Twenty-one EFL teachers were selected asexperienced and less ...
متن کاملKnowledge Elicitation for Design Task Sequencing Knowledge
There are many types of knowledge involved in producing a design (the process of specifying a description of an artifact that satisfies a collection of constraints [Brown, 1992]). Of these, one of the most crucial is the design plan: the sequence of steps taken to create the design (or a portion of the design). A number of knowledge elicitation methods can be used to obtain this knowledge from ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008